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This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) APPLICANT: Anderson, David J. 

6 Saito, Tetsuichiro 
7 

8 (ii) TITLE OF INVENTION: A NOVEL HOMEODOMAIN PROTEIN 
9 

10 (iii) NUMBER OF SEQUENCES: 22 

11 

12, (iv) CORRESPONDENCE ADDRESS: 

lir (A) ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

14 (B) STREET: Four Embarcadero Center, Suite 3400 

15 (C) CITY: San Francisco 

16 (D) STATE: California 

17 (E) COUNTRY: United States 

18 (F) ZIP: 94111 
19 

20 (V) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Floppy disk 

22 (B) COMPUTER: IBM PC compatible 

23 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

24 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
25 

26 (Vi) CURRENT APPLICATION DATA: 

27 (A) APPLICATION NUMBER: US 08/701,278 

28 (B) FILING DATE: 22-AUG-1996 
2 9 (C) CLASSIFICATION: 

30 

31 (viii) ATTORNEY/ AGENT INFORMATION: 

32 (A) NAME: Silva, Robin M. 

33 (B) REGISTRATION NUMBER: 38,304 

34 (C) REFERENCE/DOCKET NUMBER: A-63770-1 
35 

36 (ix) TELECOMMUNICATION INFORMATION: 

37 (A) TELEPHONE: (415) 781-1989 

38 (B) TELEFAX: (415) 398-3249 
39 

40 

41 (2) INFORMATION FOR SEQ ID NO : 1 : 
42 

43 (i) SEQUENCE CHARACTERISTICS: 

44 (A) LENGTH: 2424 base pairs 

45 (B) TYPE: nucleic acid 

46 (C) STRANDEDNESS : unknown 



PAGE: 2 RAW SEQUENCE LISTING DATE: 03/1 1/97 

... . PATENT APPLICATION US/08/701,278 . TIME: 15:44:57 

INPUT SET: S16092.raw 

47 (D) TOPOLOGY: unknown 

48 

49 (ii) MOLECULE TYPE: DNA (genomic) 

50 

51 

52 

53 

54 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

55 

56 GCAGAGGTAG GCAGGGTTCC CGAGCCGCTC TCCCGGCTCC CTGCTCTGGG CCTTGGGGCT 60 
57 

58 CCACCGGCTT CTTGGCCCGA GCTGCTGCGC GTGCAGATGG CCTTGCGCGA TCGCCGGACC 120 
59 

60 CCGCTGCGGT GGCCAAGTGC AGGGCTTGTG GCTGGGACCC CTGAGAACCA GGAGCCAGAC 180 
61 

62 TGTGCTCAGC TTGCCAGGCC GGAGCCACGC ACGGGCACAA GTCTGTCAGG CCGCCATCAG 240 
6 3 

64 TCCTGGTCCA GCCGTCAGGG CCCATCCGAC CGTCGGCGAT GTTTTATTTC CACTGCCCGC 300 

65 ^ 

66 CACAGCTAGA GGGCACAGCG CCTTTTGGTA ACCACTCTAC GGGGGATTTT GATGATGGGT 360 
67 

68 TTCTTAGAAG AAAACAGCGC AGAAATCGGA CAACCTTCGC TCTTCAGCAG TTGGAAGCTC 420 
69 

70 TGGAGGCAGT CTTTGCCCAA ACACACTACC CAGATGTCTT CACCAGAGAA GAGCTAGCCA 480 
71 

72 TGAAAATAAA CCTCACAGAA GCCAGAGTGC AGGTTTGGTT CCAGAACCGA AGAGCCAAGT 540 
73 

74 GGAGGAAGAC AGAGAGAGGG GCCTCTGACC AGGAACCAGG GGCTAAGGAA CCCATGGCAG 600 
75 

76 AGGTGACACC ACCCCCAGTG AGGAACATCA ACTCTCCACC CCCAGGGGAC CAGGCCCGGG 660 
77 

78 GCAAGAAGGA GGCCCTGGAG GCCCAGCAGA GCCTGGGACG CACAGTGGGC CCCGCCGGGC 720 
79 

80 CTTTCTTCCC CTCCTGCTTG CCAGGGACCC TCCTGAACAC AGCCACTTAT GCCCAGGCCC 780 
81 

82 TGTCCCATGT GGCATCTCTG AAAGGGGGCC CACTGTGCTC TTGCTGCGTC CCAGACCCTA 840 
83 

84 TGGGGCTCTC CTTCCTCCCC ACTTACGGTT GCCAGAGTAA CCGCACAGCC AGCGTGGCTG 900 
85 

86 CCCTGCGCAT GAAGGCCCGC GAGCATTCAG AAGCGGTCCT GCAGTCTGCC AACCTTCTGC 960 
87 

88 CGTCCACCAG CAGCAGCCCC GGCCCTGCCT CCAAGCAGGT GCCTCCAGAA GGCAGCCAGG 1020 
89 

90 ACAAGCCCTC CCCAACGAAG GAACAGAGCG AGGGAGAGAA GAGCGTATGA GGGTCCGGAG 1080 
91 

92 AACCCAGCTG GGAGCCCTGC CCACCCCTGC TTCTCTCAGC CTCAGCCCTG CCAGCCTCTG 1140 
93 

94 AACCACAAGG AGTAGCCACC TCCTCATGGA TCTGACAGGG CAAACGGGAC CTGCAAGCTG 1200 
95 

96 GTTGAGACCT GAAGAGTCCC TCTAGAATTC TGCTGGTAGG CTGTGTTGTT CTCGCTTTTC 1260 
97 

98 CTTTGGTGAC ATTTTCCGAT GGCTCTTAGT GACTCTGGAC ACTGCTCTGT GATGAGGTCC 1320 
99 
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RAW SEQUENCE LISTING DATE: 03/ 1 i/97 

PATENT APPLICATION US/08/701,278 TIME: 15:45:02 

INPUT SET: S16092.raw 

CTGTTTTTTG CTTTTTGTTT TGTCTCTTTT TTTTTGTTTT GTTTTGTTTT ATTTTCCAGG 1380 

CCAAGCAGCC TTGGAGCAAA GCAGATTAGT TTATTCCACC ATCCTTCTTG AGATATCTGG 1440 

GAAGGTCTTG TCAATTCCAA GGACTGTGGC AAGGATCATC CGTGAAAGAT GCCAAGAAGT 1500 

GACATCTCAT GACAGGAAAT GAGACGGGCA CTCCCATATT GCTTAAGAAC CACAGAACTG 1560 

GTGGACTATC AGCCAGTTCT CACTCCCTTC AGCCAGGACT GGCATCGGCC TCCTTTGTCT 1620 

TGTTTAAAGG AATTAGCTGA GGTTTTGGCT AGGAAGTGAC AAGATATGGG CTGAAGACAT 1680 

TGTGGTCCTG ACCCTAGCAG ATCTCCCTGG GCACATCTGA CCTGGTCCAG TCAGGCAGGT 1740 

TGTCAGTTCG GGGATGGGGG CTGCTCTGCT GATTCTGTGT GTGGGTTCCC TGCAATTAGA 1800 

GTGTTCACTT GCAGGCCCCG CTCTCTTCAG AAGAGTGATG GGAAGTTCAC CAATCAGAAT 1860 

GTAGCTTTGT AGCCCAGGAA AGGACCAGAG TCCTTGAAGC GGTAGGAAAT CCCTAGGAAG 1920 

GCCCCTTAAA TACTTATGCC CAGATGAGCT GCCCTTCTTC CTATCCCCGT ATGTCGAGAG 1980 

GTTGACGAGA CAGGAAAGCC AGGAAGATGA CTCCGTGTGG CAGAAGAGAA TGGAGTCCAA 2040 

AGGGCCAACT TTCACAGAGA TTTCTGCCGC AGTTTAGCGT GGCTGTGTTC TTTCACGCGA 2100 

TGGTGACTTC GGAGAGATCA GAGGGAGATG TGCAATAGCA TGAGCCCCGC TCCTGGCCCG 2160 

GGTCCTGGAA AGGTTGTGGT TGTTTGGTGG CTTTGGCTGA TGATGTTTCC ACGCAAACAG 2220 

ATATTGCTTT CATGATGGCT GTTCTCATTT CAGTTCTGAT AATCGAGACG CTGTGCTCCC 2280 

AGGCGCTCTG CCTCCCCTTA ACTCTTCAGG AGCACCCCCT CCCCTGTAAT ACTCCTAAGT 2340 

GTATCGTGCC TCACTTACGG TTACTGCAAC ACATTTGATG GAACACACTG TCTCCTTTAA 2400 
AAAAGAAAAA AAAAAAAAAA AAAA 2424 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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PATENT APPLICATION- US/08/701,278 TIME: 15:45:07 

INPUT SET: S16092.raw 

153 Met Phe Tyr Phe His Cys Pro Pro Gin Leu Glu Gly Thr Ala Pro Phe 

154 15 10 15 
155 

156 Gly Asn His Ser Thr Gly Asp Phe Asp Asp Gly Phe Leu Arg Arg Lys 

157 20 25 30 
158 

15 9 Gin Arg Arg Asn Arg Thr Thr Phe Ala Leu Gin Gin Leu Glu Ala Leu 

160 35 40 45 

161 

162 Glu Ala Val Phe Ala Gin Thr His Tyr Pro Asp Val Phe Thr Arg Glu 

163 50 55 60 
164 

165 Glu Leu Ala Met Lys lie Asn Leu Thr Glu Ala Arg Val Gin Val Trp 

166 65 70 75 80 
167 

168 Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys Thr Glu Arg Gly Ala Ser 

169 85 90 95 
170 

171 Asp Gin Glu Pro Gly Ala Lys Glii.jPro Met Ala Glu Val Thr Pro Pro 

172 100 "l05 110 
173 

174 Pro Val Arg Asn lie Asn Ser Pro Pro Pro Gly Asp Gin Ala Arg Gly 

175 115 120 125 
176 

177 Lys Lys Glu Ala Leu Glu Ala Gin Gin Ser Leu Gly Arg Thr Val Gly 

178 130 135 140 
179 

180 Pro Ala Gly Pro Phe Phe Pro Ser Cys Leu Pro Gly Thr Leu Leu Asn 

181 145 150 155 160 
182 

183 Thr Ala Thr Tyr Ala Gin Ala Leu Ser His Val Ala Ser Leu Lys Gly 

184 165 170 175 
185 

186 Gly Pro Leu Cys Ser Cys Cys Val Pro Asp Pro Met Gly Leu Ser Phe 

187 180 185 190 
188 

189 Leu Pro Thr Tyr Gly Cys Gin Ser Asn Arg Thr Ala Ser Val Ala Ala 

190 195 200 205 
191 

192 Leu Arg Met Lys Ala Arg Glu His Ser Glu Ala Val Leu Gin Ser Ala 

193 210 215 220 
194 

195 Asn Leu Leu Pro Ser Thr Ser Ser Ser Pro Gly Pro Ala Ser Lys Gin 

196 225 230 235 240 
197 

198 Val Pro Pro Glu Gly Ser Gin Asp Lys Pro Ser Pro Thr Lys Glu Gin 

199 245 250 255 
200 

201 Ser Glu Gly Glu Lys Ser Val 

202 260 
203 

204 (2) INFORMATION FOR SEQ ID NO: 3: 
205 
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RAW SEQUENCE LISTING DATE: 03/ 1 1/97 

patent Application us/08/70i,2?8 time: is** 12 

INPUT SET: S16092.raw 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Gly Phe Leu Arg Arg Lys Gin Arg Arg Asn Arg Thr Thr Phe Ala Leu 
15 10 15 

Gin Gin Leu Glu Ala Leu Glu Ala Val Phe Ala Gin Thr His Tyr Pro 
20 25 30 

Asp. Val Phe Thr Arg Glu Glu Leu Ala Met Lys lie Asn Leu Thr Glu 
35 40 45 

Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys 
50 55 60 

Thr Glu Arg Gly Ala Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Leu His Glu Lys Arg Lys Gin Arg Arg lie Arg Thr Thr Phe Thr Ser 
15 10 15 

Ala Gin Leu Lys Glu Leu Glu Arg Val Phe Ala Glu Thr His Tyr Pro 
20 25 30 

Asp lie Tyr Thr Arg Glu Glu Leu Ala Leu Lys lie Asp Leu Thr Glu 
35 40 45 

Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Phe Arg Lys 



+ 



SEQUENCE VERIFICATION REPORT DATE: 03/ 1 1 191 

PATENT APPLICATION US/08/701,278 TIME: 15:45:19 

INPUT SET: S16092.raw 



Original Text 



